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(54) ABR flow control using single bit congestion indication and wavelet Transform filtering 



(57) A flow control system and methodology con- 
trols and avoids congestion in an Asynchronous Trans- 
fer Mode (ATM) network. The congestion state of a net- 
work switch is determined based on incoming Available 
Bit Rate (ABR) traffic, Constant Bit Rate (CBR) traffic, 
and Variable Bit Rate (VBR) traffic. High-frequency fluc- 
tuations due to VBR traffic are filtered out of the CBR/ 
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Wavelet-based 
Processor 



VBR traffic to determine the underlying trend of network 
traffic. A filtering mechanism is based on wavelet trans- 
forms (164). The switch congestion state is determined 
based on the current ABR traffic, the filtered CBR/VBR 
traffic, and the current status of buffers (166,170) in the 
switch. The congestion state can then be used to adjust 
the ABR rate according to a feedback scheme such as 
Explicit Forward Congestion Indication (EFCl). 
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Description 

Background of the Invention 

5 [0001] The present invention relates to congestion control and avoidance in a communications network and, more 
particularly, to controlling and avoiding congestion in an Asynchronous Transfer Mode (ATM) network using a wavelet- 
based filtering mechanism in an Available Bit Rate (ABR) Explicit Forward Congestion Indication (EFCI) flow control 
solution. 

[0002] The ATM Forum Traffic Management Specification, Version 4.0, 1 996 (TM4.0), currently defines five service 
10 classes to support the diverse requirements of multimedia traffic: Constant Bit Rate (CBR), Real-Time Variable Bit Rate 
(RT-VBR), Non-Real-Time Variable Bit Rate (NRT-VBR), Available Bit Rate (ABR), and Unspecified Bit Rate (UBR). 
Both CBR and RT-VBR service classes provide stringent cell transmission delay (CTD), cell delay variation (CDV), 
and cell loss ratio (CLR) constraints. NRT-VBR service provides stringent CTD and CLR constraints but no CDV guar- 
antees. CBR, RT-VBR, and NRT-VBR all rely on the network reservation of resources to provide a required quality of 
75 service (QoS). 

[0003] ABR and UBR service classes are defined for applications with unpredictable bandwidth requirements, that 
are sensitive to cell loss, but are able to tolerate a certain amount of delay. ABR and UBR service classes are designed 
so that applications in these classes can grab any unused network resources that VBR and CBR traffic does not utilize, 
i.e., unused bandwidth and buffer space. Gains due to statistical resource utilization, however, come at the risk of 
20 potential congestion when many applications compete for network resources. Therefore, proper congestion control 
must be in place to ensure that network resources can be shared in a fair manner and that performance objectives 
such as cell loss ratio can be maintained. 

[0004] UBR service is truly "best-effort," and thus the network provides no service guarantees for UBR. ABR, how- 
ever, provides some service guarantees. Therefore, ABR must comply with the flow control framework specified in 

25 TM4.0, which supports several types of feedback to control the source rate in response to changing transfer charac- 
teristics. This feedback information is conveyed to the source, which adapts its traffic in accordance with the feedback. 
The feedback information includes the state of congestion and a fair share of the available bandwidth according to a 
network-specific allocation policy. To ensure interoperability, an ABR end system must always implement the source 
and destination behavior defined in TM4.0. In particular, the cell rate from an ABR source is bounded by a Peak Cell 

30 Rate (PCR) and Minimum Cell Rate (MCR) and must abide by a Cell Delay variation Tolerance (CDVT). 

[0005] The ABR congestion control scheme specified in TM4.0 is a rate-based, closed-loop, per-connection control 
that utilizes the feedback information from the network to regulate the rate of cell transmission at the source. Figure 1 
illustrates the basic operation of ABR congestion control. In network 100, source 102 transmits data cells 108 to des- 
tination 104 via one or more switches in network 100, one of which is shown as switch 106 in Figure 1. Source 102 

35 also generates special probe cells 110 referred to as resource management (RM) cells in proportion to its current data 
cell rate. Destination 104 turns around and sends RM cells 110 back to source 102 in the other direction. The RM cells, 
which can be examined and modified by the switches in both forward and backward directions, carry feedback infor- 
mation of the state of congestion and the fair rate allocation. The following summarizes the operation of the rate-based 
control scheme, the details of which, along with the detailed RM cell format, are found in TM4.0, which is incorporated 

40 herein by reference. 

[0006] A switch shall implement at least one of the following methods to control congestion: 

a) Explicit Forward Congestion Indication (EFCI) marking: The ATM switch may set the EFCI state in the data cell 
headers. Most first-generation ATM switches implemented this mechanism even before the RM cell was fully de- 

45 fined. 

b) Relative rate marking: The ATM switch may set the congestion indication (CI) bit or the no increase (Nl) bit in 
the forward and/or backward RM cells. 

c) Explicit rate marking: The ATM switch may reduce the explicit rate (ER) field of forward and/or backward RM cells. 

so [0007] Switches that implement EFCI marking and relative rate marking are known as binary switches, which are 
simple to implement but may result in unfairness, congestion oscillation, and slow congestion response. Switches that 
implement explicit rate marking are generally referred to as ER switches and require sophisticated mechanisms at the 
switches to compute a fair share of the bandwidth. The TM4.0 standard-defined source and destination behaviors, 
however, allow interoperation of all three congestion control options according to the following rules. 

55 [0008] Once the source has received permission, it begins scheduling cells for transmission at the allowed cell rate 
(ACR). The ACR is initially set to the initial cell rate (ICR) and is always bounded between the minimum cell rate (MCR) 
and the peak cell rate (PCR) specified by the application at call setup. An RM cell precedes transmission of any data 
cells. The source continues to send RM cells, typically after every Nrm data cells, where Nrm is the maximum number 
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of data cells a source may send for each forward RM cell. The source places the ACR value in the current ceil rate 
(CCR) field of the RM cell, and the rate at which it wishes to transmit cells (usually the PGR value) in the ER field. The 
RM cells traverse forward through the network, and the destination turns the RM cells around in the backward direction. 
Intermediate switches on the path notify the source of congestion by marking the EFCI bit in the data cells and the CI 

5 or Nl bit in the RM cell, and/or reducing the ER value in the RM cells. 

[0009] Upon receiving the RM cell in return, the source should adapt its ACR to the information carried in the RM 
cell. If the CI bit is not set, the source may linearly increase its ACR by a fixed increment (RIF*PCR), where the rate 
increase factor (RIF) is determined at call setup. This increase can reach the ER value in the.RM cell, but should never 
exceed the PCR. If the CI bit is set, the source must exponentially decrease its ACR by an amount greater than or 

10 equal to a proportion of its current ACR (RDF* ACR), where the rate decrease factor (RDF) is also determined at call 
setup. The factors RIF and RDF control the rate at which the sources increase and decrease, respectively. 
[0010] If the ACR is still greater than the returned ER, the source must further decrease its ACR to the returned ER 
value, although it should never be below the MCR. If the Nl bit is set, the source should observe the CI and ER fields 
in the RM cell, but it is not allowed to increase the ACR above its current value. 

is [0011] In EFCI marking binary feedback control schemes, the ATM switch sets the EFCI bit in the ATM cell header 
during congestion. As described above, end systems use this one-bit information contained in each cell to increase/ 
decrease their cell rates by an incremental amount. This basic mechanism imposes very small implementation burdens 
on switches and requires minimal processing. Several mechanisms exist for the switch to determine the onset of con- 
gestion, including the following EFCI schemes. 

20 [0012] Many first generation ATM switches implemented binary schemes based on a simple first-in-first-out (FIFO) 
queuing mechanism shared by all ABR connections. Under this basic EFCI scheme, a switch determines the onset of 
congestion at time interval f when the queue level q(t) in one of the switch's buffers exceeds some preset threshold 
q T , shown in Figure 2A: 

25 q(t) > q T => EFCI bit(data cell) = 1 . 

In some implementations of this scheme, the switch declares the onset of congestion if the instantaneous queue size 
exceeds a threshold q hight shown in Figure 2B, and detects a normal "uncongested" state when the queue size drops 

30 below another threshold q low , also shown in Figure 2B. 

[0013] This basic binary EFCI scheme, which detects nodal congestion by comparing the present queue size with 
a preset threshold, has several disadvantages. First, congestion detection is delayed by the amount of time required 
to build up the queue. Second, the resolution of congestion is also delayed by the amount of time required to drain the 
queue. Such delayed detection followed by the unnecessary extension of congestion periods significantly increases 

3$ the oscillation period as well as the oscillation magnitude of ABR traffic within the network, causing a large consumption 
of buffer resources. Because both the queue levels and the ABR source rates oscillate continually, large buffers are 
required to maximize network utilization. 

[0014] A second binary EFCI scheme monitors the rate of cells coming into the queue, R in Figure 3, relative to a 
threshold on a short interval basis. The threshold may be defined, for example, in terms of a targeted link utilization, 

40 e.g., 0.9C, where C is the link capacity. If R exceeds a predefined threshold, the switch declares congestion. In a simple 
implementation of this scheme, the switch declares congestion if the count of queue input cells exceeds in an 
interval of T int This scheme improves upon the first scheme by implementing congestion avoidance, i.e., by controlling 
the incoming rate, the switch avoids the onset of congestion rather than detecting congestion after it occurs. 
[0015] In Y. Zhao, S. Q. Li, and S. Sigarto, "An Improved EFCI Scheme with Early Congestion Detection," ATM Forum 

45 Contribution AF-TM 96-1084, February, 1 996, the authors proposed a variant of the rate-thresholding scheme called 
EFCI with Early Congestion Detection (EFCI-ECD). To improve network resource utilization, this scheme detects con- 
gestion by monitoring the low-pass filtered arrival rates. Recent multimedia queuing analyses (S. Q. Li and C. Hwang, 
"Queue Response to Input Correlation Functions: Continuous Spectral Analysis," IEEE/A CM Trans. Networking, Vol. 
1, No. 6, December, 1993, pp. 678-692; S. Q. Li, S. Chong, and C. Hwang, "Link Capacity Allocation and Network 

so Control by Filtered Input Rate in High Speed Networks," IEEE/ACM Trans. Networking, Vol 3, No. 1 , February, 1995, 
pp. 10-15) indicate that queue congestion is mainly captured by low-frequency traffic behavior. Filtering of the VBR 
traffic, for instance, eliminates the high-frequency fluctuations which are easily absorbed by buffering; ABR rates should 
not be adjusted based on these short-term VBR bursts. 

[0016] Operation of a rate-thresholding scheme that monitors low-pass filtered cell arrival rates can be described 
55 mathematically as follows. Let C denote the link capacity and p r denote a targeted utilization (e.g., 0.9) so that pjC 
specifies some preset rate threshold. Let r A {t) denote the total measured ABR traffic rate and let r^t) denote the total 
measured background or guaranteed traffic {i.e., CBR and VBR) rate at time interval t; let^f) andr a (r) represent the 
corresponding low-pass filtered rates. A switch implementing the rate thresholding scheme determines that congestion 
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is imminent at time interval t and sets the EFCI bit when the aggregate filtered rates exceed the rate threshold: 

r A (t) + r B (t) zpjC=> EFCI bit (data cell) = 1 

5 

By removing the high-frequency fluctuations, filtering improves stability and reduces oscillations. However, to ensure 
stability, the rate-thresholding scheme should be combined with the basic queue fill thresholding scheme, Le., the EFCI 
bit should also be set when a queue threshold is exceeded. Using low-pass filtered rates as a congestion indicator 
10 improves performance by detecting congestion changes more rapidly. Depending on the burstiness of the network 
traffic, p 7 may be tuned to values closer to 1 . 

[001 7] A larger value of p r decreases the safety margin of slack bandwidth that may be used to accommodate sudden 
bursts in traffic. Hence, network operators that operate their networks with a high p r run a higher risk of violating the 
service requirements of some connections. The appropriate utilization target of any given link depends on the charac- 
15 teristics of the traffic flowing through it. If each source's rate is small compared to link capacity (small grain size) and 
bursts are short, the link's utilization target can be set higher. Bursty sources with large, long bursts or long idle periods 
require a lower link utilization target 

[0018] In addition to detecting congestion early and reducing oscillations in ABR traffic, the low-pass filtered rate- 
thresholding scheme results in a substantially shorter time period of congestion than the basic queue-thresholding 
20 scheme. Because of improved congestion detection performance, the buffer capacity required at each switching node 
is substantially reduced. 

[0019] A third binary EFCI scheme, illustrated in Figure 4, monitors the change in queue size Q to detect the onset 
of congestion. In this scheme, the first difference of queue depth, AQ/At is used as the congestion indicator, allowing 
tight control of the queue size. 

25 [0020] The main advantage of existing binary feedback schemes is their implementation simplicity. The first-gener- 
ation ATM switches implement a simple FIFO queuing mechanism and mark the EFCI bit in the event of congestion. 
However, the proper tuning of the rate increase and decrease parameters (R!F and RDF) and the queue or rate thresh- 
old (q T or p T ) is required. The maximum queue length and the achieved utilization depend on the proper tuning of all 
the parameters. Specification of g T and p r yields vastly different buffer requirements and loads depending on the 

30 selection of RIF and RDF. 

[0021] The main disadvantage of the existing binary feedback schemes is that they are not fair in a multiple-node 
network. The schemes may cause some connections to have unfair access to the available bandwidth depending on 
the network topology and end system behavior. It has been observed that sources which have several hops on their 
routes are more likely to have an EFCI bit set to 1 than sources with few hops. As a result, it is unlikely that long-hop 

35 connections will be able to increase their rates, and consequently they will be beaten down by short-hop connections. 
This has been referred to as the beat-down problem (J. Bennet and G. T Des Jardins, "Comments on the July PRCA 
Rate Control Baseline, " ATM Forum Contribution AF-TM 94-0682, July, 1 994). 

[0022] In A. Arulambalam, X. Chen, and N. Ansari, "Allocating Fair Rates for Available Bit Rate Service in ATM 
Networks," IEEE Communication Magazine, November, 1996, pp. 92-100, the authors propose several alternatives 

40 for alleviating the unfairness problem in binary feedback schemes. The first alternative is to provide each connection 
or group of connections with separate queues. Such a per-connection queuing mechanism makes it possible to deter- 
mine the congestion status of a connection in isolation from the rest of the connections. Because only those connections 
that cause the congestion of their associated queues are subject to congestion marking, the beat-down problem in the 
binary scheme can be eliminated. Thus, this isolation ensures fair access to buffer space and bandwidth among all 

45 competing connections. 

[0023] This per-connection queuing approach also enjoys several other distinct benefits. First, per-connection queu- 
ing can be used to help protect individual connections effectively since a misbehaving connection can only cause its 
own queue to overflow. Second, it allows the delay and the loss behavior of individual connections to be decoupled 
from each other, leading to better QoS control. Finally, per-connection information is readily available to help the im- 

50 piementation of other types of congestion control, such as early packet discard (EPD) and partial packet discard (PPD) 
mechanisms. 

[0024] Although per-connection queuing offers considerable advantages over single FIFO queuing, this comes at 
the significant cost of switch design and implementation. Efficient buffer management and scheduling among the 
queues can be costly to design and implement. Such implementation may be prohibitive when the number of connec- 
55 tions grows substantially large. 

[0025] The second feasible alternative is to provide selective or intelligent marking on a common FIFO queuing 
mechanism when congestion takes place. In this alternative, a switch computes a fair share and a connection is asked 
to decrease its rate by marking the EFCI bit only if its current cell rate is above the computed fair share, in this approach, 
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cells belonging to virtual circuits are selectively marked based on their current cell rates which are carried in the RM 
cells, in addition to EFCI marking, CI and Nl bits in the RM cells can be used to regulate the rate at the source. 
[0026] Although per-connection queuing and intelligent marking provide good solutions for the beat-down problem, 
they do not address the other problems of binary feedback schemes. In particular, as has been described, binary 

5 feedback schemes are generally slow in response to congestion because it may take several round-trips to alleviate 
congestion or utilize any unused bandwidth. Another problem is that binary schemes may generate large oscillations 
even at steady-state, making it difficult to achieve very high link utilization. The best solution for improving stability and 
reducing oscillations in the ABR cell rates and queue levels in a binary scheme is to implement a low-pass filtered rate- 
thresholding scheme that fitters input traffic by removing the high-frequency fluctuations in the arrival process. 

10 [0027] The multimedia queuing analyses by Li et al. observe that in a finite-buffer system, the traffic arrival process 
is essentially characterized by the low-frequency components, the effective capacity at each node is essentially cap- 
tured by the low-frequency input traffic filtered at a properly selected cut-off frequency, and the high-frequency com- 
ponents can be absorbed by the finite buffer capacity at each node. However, many filtering techniques known or used 
today are generally not suitable for multimedia applications. For example, in Li et al. and S. Chong, S. Q. Li, and J. 

15 Ghosh, "Dynamic Bandwidth Allocation for Efficient Transport of Real-Time VBR Video over ATM," INFOCOM'94, 1994, 
pp. 81-90, the authors use a Finite Impulse Response (FIR) filter with a very sharp cut-off frequency. This filtering 
approach has certain limitations because the cut-off frequency has to be known for each traffic stream. Although this 
approach may be applicable to video servers where the filtered input rate can be precalculated and used at connection 
setup time, it is difficult to predetermine what the cut-off frequency should be for an aggregate traffic stream resulting 

20 from the multiplexing or concentration of different traffic streams. Furthermore, simple filtering at a certain cut-off fre- 
quency is not sufficient because there is no notion of intensity of the frequency involved. The approach adopted by 
Zhao et al., in which the filtering function is implemented by a simple short-term moving average operation, is also 
unsuitable for multimedia connections because the moving average filter cannot handle nonstationary signals or signals 
with significant transient components. 

25 [0028] Many traffic filtering mechanisms are designed in an ad hex: manner. Because the arrival and service rates 
in networks are time-varying, it is crucial to estimate network parameters by an adaptive procedure that takes changes 
into account. Furthermore, since the estimator must be implemented in real-time, its computational and space com- 
plexity should be as small as possible. Time weighted moving averages {e.g., exponentially weighted moving average 
(EWMA)) have these properties and are often used in practice. However, appropriate filter weights are difficult to de- 

30 termine and are usually chosen heuristically. 

[0029] It is desirable, therefore, to provide a binary feedback control scheme that monitors the queue input rate 
relative to a threshold. It is more desirable to provide such a scheme that operates on low-pass filtered traffic so that 
ABR rates are not modified based on fluctuations due to unfiltered VBR traffic. It is even more desirable to provide a 
filtering process that is simple and inexpensive to implement and is effective in filtering multimedia application traffic. 

35 Finally, it is desirable to provide a binary feedback control scheme that complies with existing standards. 

[0030] This invention seeks to provide an improved scheme and associated method by providing a flow control 
solution using a wavelet transform-based filtering mechanism. 

[0031] A system consistent with the present invention is for use in controlling congestion in a network with a network 
switch having a first buffer for receiving packets of a first type into a first queue and a second buffer for receiving packets 

40 of a second type into a second queue. The system includes a first measurement module coupled to the first buffer for 
measuring an arrival rate of the packets of the first type, a second measurement module coupled to the second buffer 
for measuring an arrival rate of the packets of the second type, and a flow control module coupled to the first and 
second measurement modules and the second buffer for determining a congestion state of the switch based on the 
arrival rates of the first and second type packets and the queue length of the second queue. The first measurement 

45 module includes a sampler for sampling the arrivals of the first type packets and a wavelet transform processor coupled 
to the sampler for determining an effective arrival rate of the first type packets in response to the sampled arrivals. 
[0032] Methods are also provided for carrying out the systems consistent with the present invention. 
[0033] Consistent with the present invention, the first type packets comprise CBR and VBR packets, and the second 
type packets comprise ABR packets. However, it should be appreciated by one skilled in the art that systems and 

50 methods consistent with the present invention have a potentially broader application to flow control in other packet 
networks. 

[0034] The advantages accruing to the present invention are numerous. Methods and systems consistent with the 
present invention provide an efficient and effective binary ABR flow control solution. The solution results in smaller 
oscillations in ABR rates, and therefore smaller buffer requirements in network switches. 
55 [0035] The above desires, other desires, features, and advantages of the present invention will be readily appreciated 
by one of ordinary skill in the art from the following detailed description of the preferred implementations when taken 
in connection with the accompanying drawings. 
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Brief Description of the Drawings 
[0036] 

5 Figure 1 is a diagram illustrating the basic operation of ABR congestion control; 

Figures 2A-B illustrate an EFCI congestion control scheme with simple queue fill thresholding; 

Figure 3 illustrates an EFCI congestion control and avoidance scheme with queue input rate thresholding; 

Figure 4 illustrates an EFCI congestion control scheme with queue growth rate thresholding; 

Figure 5 is a high-level block diagram of an ABR flow control system consistent with the present invention; 
10 Figure 6 illustrates an algorithm for calculating the forward fast wavelet transform; 

Figure 7 illustrates an algorithm for calculating the inverse fast wavelet transform; 

Figure 8 illustrates components of an ABR flow control system consistent with the present invention that includes 
a wavelet-based traffic filtering mechanism; 

Figure 9 is a flowchart of a method for calculating effective capacity consistent with the present invention; 
is Figure 10 illustrates an example of a method for calculating effective capacity consistent with the present invention; 

Figure 11 illustrates a timing diagram associated with a method for calculating effective capacity consistent with 
the present invention; 

Figures 1 2A-B illustrate event diagrams associated with a method for calculating effective capacity consistent with 
the present invention; and 
20 Figure 1 3 illustrates an ABR flow control system consistent with the present invention. 

Detailed Description of the Preferred Embodiments 

[0037] Figure 5 illustrates, at a high level, an ABR flow control system consistent with the present invention. The 
25 system, shown generally by reference numeral 1 50, operates within a network switch, such as an ATM switch. Incoming 
CBR/VBR cells arrive at the switch and enter buffer 152. Bandwidth measurement module 154 determines the effective 
bandwidth utilized by current CBR/VBR traffic based on information received about cell arrivals. Flow control module 
1 56 uses the effective CBR/VBR bandwidth and the effective ABR rate (C a ) and queue status (q a ) to detect congestion 
and outputs a decision related to flow control, e.g., the EFCI bit should be set to 1 because congestion has been 
30 detected. Consistent with the present invention, bandwidth measurement module 154 and flow control module 156 
may be implemented as software stored in a memory in the ATM switch and executed by a processor in the ATM 
switch. The memory may be any type of computer-readable medium, such as any electronic, magnetic, or optical read/ 
write storage device. 

[0038] As explained above, it is desirable to measure the bandwidth of incoming traffic in such a way that high- 
35 frequency components due to VBR traffic are filtered out. By extracting the underlying low-frequency traffic profile, 
bandwidth measurement module 154 provides flow control module 156 with a more stable effective bandwidth meas- 
urement on which to base a flow control decision. 

[0039] It is also desirable that the filtering mechanism be adaptive and computationally simple. Time weighted moving 
average schemes have these properties, but it is difficult to choose correct filtering weights for such schemes. There- 
to fore, an alternative signal analysis method is needed to effectively filter the high-frequency components. 

[0040] One example of a signal analysis tool is the Fourier Transform (FT), which has become a cornerstone of 
modem data analysis. The FT translates a function in the time domain into a function in the complex (real and imaginary) 
frequency domain, where it can be analyzed for its frequency content. The FT describes the original function in terms 
of orthogonal basis functions of sines and cosines of infinite duration. The Fourier coefficients of the transformed 
45 function represent the contribution of the sines and cosines at each frequency The FT is most commonly used in the 
form of the Discrete Fourier Transform (DFT), which analyzes discretely sampled time series. The DFT works under 
the assumption that the original time domain function is periodic in nature. As a result, the DFT has difficulty with 
functions that have transient components, that is, components which are localized in time. This is especially apparent 
when a signal has sharp transitions. Another problem is that the DFT of a signal does not convey any information 
50 pertaining to translation of the signal in time other than the phase of each Fourier coefficient. The phase values are 
averaged for the input function. Applications that use the DFT often work around the first problem by windowing the 
input data so that the sampled values converge to zero at the endpoints. Attempts to solve the second problem, such 
as the development of the short-time Fourier transform (STFT), have met with marginal success. 

55 Wavelet Functions 

[0041] In recent years, new families of orthogonal basis functions have been discovered that lead to transforms that 
overcome the problems of the DFT These basis functions are called wavelets, which, unlike the sine and cosine wave 
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of the FT, do not need to have infinite duration. They can be non-zero for only a small range of the wavelet function. 
This "compact support" allows the wavelet transform (WT) to transform a time domain function into a representation 
that is not only localized in frequency (like the FT) but in time as well. With the frequency content evolving with time 
for most real-time signals (including traffic in computer networks), time-frequency analysis using wavelets has become 
5 a more powerful tool. 

[0042] The fundamentals of wavelet functions will now be described to the extent they pertain to the present invention. 
Details can be found in S.G. Mallat, "A Theory for Multiresolution Signal Decomposition: The Wavelet Representation, " 
IEEE Tran. Pattern Analysis and Mach. Intelligence, Vol. II, No. 7, July, 1 989, pp. 674-693, and O. Rioul and M. Vetterli, 
■Wavelets and Signal Processing," IEEE SP Magazine, October, 1 991 , pp. 1 4-38, both of which are incorporated herein 
10 by reference. 

[0043] There are two fundamental equations upon which wavelet calculations are based. These are the scaling 
function (also called the dilation equation or fundamental recursion) and the primary wavelet function (mother wavelet): 



♦W - £ «vK2'-A); i|/(0 = £ (-i*W,4K2/+fl , (i) 



20 where Z is the set of integers and the are the wavelet coefficients. Both of these functions are two-scale difference 
equations and are the prototypes of a class of orthonormal basis functions of the form: 



2s = ^ 7. * € Z i|» /t (0 = ^ j, k e Z. (2) 

[0044] The parameter /controls the dilation or compression of the function in time scale as well as in amplitude. The 
parameter k controls the translation of the function in time. The set of basis functions formed by <(>(t) and \j/(t) is a system 

30 of scaled and translated wavelets. 

[0045] Wavelet systems can be either real or complex-valued, though most research has used real-valued wavelet 
systems. Wavelet systems may or may not have compact support. Wavelets have compact support if and only if they 
have a finite number of non-zero coefficients. Since compact support is what gives wavelets the ability to localize in 
both time and frequency, only wavelets of that type will be described. 

35 [0046] Several techniques have been used to create wavelet systems. These include cubic splines, complex expo- 
nentials, and parameter space constructions. Parameter space constructions have been used to construct wavelets 
which are orthonormal bases. The Haar wavelet (the first wavelet discovered) and the Daubechies wavelets belong 
to this class. Orthonormal wavelets allow perfect reconstruction of a function from its wavelet transform coefficients by 
the inverse WT. The following set of conditions must be satisfied before a set of coefficients can represent an orthon- 

40 ormal wavelet: 



45 HZ HZ 

X>***-2/ = °'/° r Ev* = 1 (3) 

HZ HZ 

50 

where a~ k is the complex conjugate of a k 

[0047] Once a wavelet system is created, it can be used to expand a function g(t) in terms of the basis functions: 
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with the coefficients calculated by the inner product as: 

c i =(Wt)\g) = igm i (t)dt (5) 

s 

d l.k = bjjfiW = l9m Jtk (t)dt. (6) 

If the wavelet system has compact support and an upper limit J is placed upon the degree of dilation/, then the expansion 
10 equation becomes: 

15 uz yi keZ 

[0048] The expansion coefficients c, represent the approximation of the original signal g(t) with a resolution of one 
point per every 21 points of the original signal. The expansion coefficients d Jk represent the details of the original signal 
at different levels of resolution. These coefficients completely and uniquely describe the original signal and can be 
20 used in ways similar to the Fourier transform. The Wavelet Transform, then, is the process of determining the values 
of c, and dj k for a given g(t) and wavelet system. 

[0049] The expansion equation naturally leads to a recursive algorithm for the wavelet transform if some assumptions 
are made. First, the function g(t) is taken as a sequence of discrete points, y, sampled at 2 m points per unit interval. 
These points can be viewed as the inner product of $ and g{f). That is, the sample points are an approximation, or c, 

25 coefficients, of the continuous function g((). This allows the c, and d jk terms to be calculated by convolution of the 
samples of g(t) with the wavelet coefficients a k Daubechies has discovered that the WT can be implemented with a 
specially designed pair of finite impulse response (FIR) filters called a quadrature mirror filter (QMF) pair. A FIR filter 
performs the dot product (or sum of products) between the filter coefficients and the discrete data samples. The act of 
passing a set of discrete samples, representing a signal, through a FIR filter is a discrete convolution of the signal with 

30 the coefficients of the filter. 

[0050] The outputs of the QMF filter pair are decimated (or down-sampled) by a factor of two, that is, every other 
output sample of the filter is kept, the others being discarded. The low-frequency (low-pass) filter output is fed into 
another identical QMF filter pair. As illustrated in figure 6, this operation can be repeated recursively as a tree or pyramid 
algorithm, yielding a group of signals that divide the spectrum of the original signal into octave bands with successively 

55 coarser measurements in time as the width of each spectral band narrows and decreases with frequency. 

[0051] Mallat has shown that the tree or pyramid algorithm can be applied to the wavelet transform by using the 
wavelet coefficients as the filter coefficients of the QMF filter pairs. The same wavelet coefficients are used in both the 
low-pass and high-pass filters. The low-pass filter coefficients are associated with the a^of the scaling function The 
output of each low-pass filter is c h or approximation components, of the original signal for that level of the tree. The 

40 high-pass filter is associated with the a k of the wavelet function The output of each high-pass filter is the d jk> or 
detail components, of the original signal at resolution 21. The c,of the previous level are used to generate the new c, 
and d Jk tor the next level of the tree. The decimation by two corresponds to the multi-resolution nature (the / parameter) 
of the scaling and wavelet functions. 

[0052] The inverse DWT essentially performs the operations associated with the DWT in the opposite direction. The 
45 inverse transform coefficients are combined to reconstruct the original signal. The same a k coefficients are used in the 
forward transform, except the ordering of the coefficients is reversed. As illustrated in Figure 7, the process works up 
the branches of the tree combining the approximation and detail signals into approximation signals with higher levels 
of detail. Instead of decimation, the signals are interpolated, that is, zeroes are placed between each approximation 
and detail sample and the new signals are passed through the low-pass and high-pass filters, respectively. The inter- 
50 mediate zero values are replaced by "estimates" derived from the convolutions. The outputs of the filters are then 
summed to form the approximation coefficients for the next higher level of resolution. The final set of approximation 
coefficients at the top of the tree in the reverse transform is a reconstruction of the original signal data points. 
[0053] An added advantage of the DWT is that it is more computationally efficient than the FFT An FFT of length N 
(where N is an integral power of 2) requires 0(A/log 2 (A/)) operations. A DWT of length N requires only O(N) operations. 
55 The FIR filters are linear processing elements. Only half as many operations are performed on each level of resolution 
in the tree algorithm as in the previous level. 

[0054] Because data traffic in computer networks is generally nonstationary (i.e., varying with time) and possesses 
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10 



15 



some form of statistical setf-similar (also known as fractal) behavior, analysis using wavelet transforms is well-suited 
to such traffic. Self-similarity in networking traffic manifests itself in burstiness across a wide spectrum of time scales. 
Self-similar traffic models have been created that can capture correlation structures observed in networking traffic. On 
the other hand, the mathematical analyses of these models are often extremely difficult. The wavelet transform enables 
analysis of the traffic without assumptions with respect to either the stationarity of the data or the statistical distribution 
embodied in the data. These features are retained because the WT is a linear operator that preserves the time local- 
ization of the data in the transform coefficients at the different levels of scale. 

Discrete Wavelet Transform (DWT) Algorithms 

[0055] Methods for signal decomposition and signal reconstruction using discrete wavelet transforms to be used in 
filtering methods consistent with the present invention will now be presented/The DWT may be computed recursively 
as a series of convolutions and decimations. At each scale level /an input sequence Si' 1 (n) is fed into a low-pass and 
high-pass filter, H(z) and G(z), respectively. The output from the high-pass filter G(z) represents the detail information 
in the original signal at the given scale j, denoted by V\P(n). The output of the low pass filter H(z) represents the remaining 
(coarse) information in the original signal, and is denoted by Si(n). H(z) and G(z) are the z-transforms of the set of 
coefficients representing the scaling function <|>(r) and the wavelet function y(t), respectively. 
[0056] The sequential DWT algorithm may be expressed as 



20 



S J (n) = //(2n-*), 

k 



(8) 



25 



W\n) = J^S j -\k)G(2n-k). 

k 



(9) 



30 



35 



40 



At the output of every level there is a downsampling operation which appears as the factor two in the argument of the 
filter coefficients. 

[0057] One problem associated with the DWT of finite duration signals occurs at the boundaries of the input sequence 
and is referred to as the edge effects problem. This problem occurs in any transformation such as a DWT that uses 
convolution on finite length signals. Edge effects are a consequence of the output of an FIR filter being L-1 samples 
longer than the input sequence. Although several edge effects solutions have been proposed, such as reflecting the 
end points of the input or periodically extending them, solutions tend to be application-specific. I n order to clearly specify 
the behavior of the algorithm at the edges of the index space, the following assumption is made. The first M/2 outputs 
from the filters, at scale /, are valid DWT coefficients, and U2A outputs are discarded. This assumption is by no means 
limiting as far as the architecture is concerned. It is always possible to derive different arrays with similar characteristics 
given any other solution to the edge effects problem. 

[0058] The following pseudo-code represents the DWT algorithm for signal decomposition in nested loop form, with 
adjusted limits accounting for edge effects. The input is X(n) = S° (n) , and Si (n) and Wi(n) correspond to the detail 
and coarse sequences shown in equations (8) and (9). The index space of the algorithm is 



45 



50 



55 



/ = 



l<n< MIV 
(n,k,j) | 1< k< 2niorn< L/2 

2n- L+\<k<2nforn> L/2 
l<j<J t 



(10) 



Without loss of generality, the harmless assumptions are made that Mand L are powers of 2 and that all indices start 
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from 1. M=2 m and J are the length of the original signal vector and the maximum number of levels for signal transfor- 
mation, respectively. 

procedure SIGNAL_DECOMP [X(n) t H t G t M. J] 
X(n) = input signal vector 

H = low-pass wavelet filter coefficients for signal decomposition 
G = high-pass wavelet filter coefficients for signal decomposition 

1. 8>(n)=X(n),S(n) = 0 t Vj,Vn 

2. fory=ltoy 

3. for/i = ltoM2' 

4. for k = max (1, In - L +1) to In 

5. 8(n)=8(n) + 8 A (k)H(2n-k) 

6. W(n) = W(n) + S JA (k) G(2n - k) 

7. endfor(£) 

8. endfor(n) 

9. endfor(/) 

10. output = 8 (n) t W (n)J = 1, 2, J 



[0059] Similarly, the inverse DWT can be used for signal reconstruction. In methods consistent with the present 
invention, as will be detailed below, only partial reconstruction of the signal is required to determine the high-frequency 
("noise") component of the signal. Therefore, the following pseudo-code represents an algorithm for reconstruction of 
only the high-frequency component. 
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procedure PARTIAL_RECONS [W(n), j = l f 2, jca/e, HG, scale] 

(This procedure computes the high-frequency ("noise") component at level 
scale. The reconstruction is done from level scale to level 1, where 
scale i 1.) 

A 

//= low-pass wavelet filter coefficients for signal reconstruction 

* A 

G= high-pass wavelet filter coefficients for signal reconstruction 
W (n),j = 1, 2, scale = output of high-pass filter. (W 1 (/?) is the highest 
is frequency sub-band) 

M = 2 m = length of original signal 
1 . for j = scale to 1 
20 2. for /i =1 to M/2° { > 

S' J l (ri) = 0 



25 



30 



35 



40 



45 



for k = max (1, 



1) to 



if (/ < .ycafe) 

sp i («) = s* 1 r«j + s* y ft> #fa-2 ft-u; 

endfor(A) 
endfor(Ai) 
endfor(/) 

4. output = S°° (n), n, = 1, 2, M 



Bandwidth Measurement 



[0060] Referring again to Figure 5, a flow control system consistent with the present invention includes bandwidth 
so measurement module 154 and flow control module 156. As explained, it is desirable that bandwidth measurement 
module 154 generate an effective capacity of CBR and VBR traffic that has had high-frequency components filtered 
out in order to provide a stable input to flow control module 1 56. For the reasons described, traffic filtering mechanism 
based on wavelet analysis is effective for network traffic. 

[0061] Figure 8 illustrates some components of bandwidth measurement module 154 consistent with the present 
55 invention, including a wavelet-based traffic filtering mechanism. This architecture is related to the wavelet-based con- 
nection admission control mechanism developed by R Droz in "Traffic Estimation and Resource Allocation in ATM 
Networks," Diss ETH Zurich, No. 11462, 1996 (found at http://www.zurich.ibm.eom/-.dro/) ("Droz I"), and "Estimation 
and Resource Allocation Based on Periodical Wavelet Analysis," 4th UK/Australian International Symposium on DSP 
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for Communication Systems, September, 1996 ("Droz II"), which are both incorporated herein by reference. 
[0062] Sampler 1 60 generates periodic measurements of the incoming CBR/VBR traffic arriving at buffer 1 66. These 
measurements consist of cell counts divided byAf, where At is the sampling interval. The sampling frequency depends 
on the link speed and the buffer size. Sampler 160 feeds its output into wavelet transform-based processor 164. In 
5 periodic intervals, processor 164 analyzes the samples and calculates the effective capacity. Buffer manager 162 
checks the queue length in buffer 166 and signals processor 164 if the queue length exceeds a certain threshold or if 
cell loss occurs. Buffer manager 162 also signals processor 162 if a certain period of time passes without cell loss. 
Processor 162 responds to these signals by adjusting the traffic filtering process. 

10 Effective Capacity 

[0063] Consistent with the present invention, processor 164 implements a method for calculating the measurement- 
based effective capacity using wavelet-based filtering. A method consistent with the present invention contains mod- 
ifications and enhancements to the algorithm given in Droz I. In general, sampled traffic is separated into signal and 
15 noise, i.e., a low-frequency component and a high-frequency component. By subtracting the high-frequency component 
from the sampled traffic measurements, the method obtains the filtered (low-frequency) component of the original 
signal. The new effective capacity is the maximum of the filtered signal during the previous measured interval. 
[0064] A method for calculating effective capacity consistent with the present invention uses the following parameters: 

20 A{f ), f = 0, 1 , 2, 3, Equidistant traffic measurements (cell counts) 

n= 0, 1, 2 Window number 

L n (f): Low-frequencies in window n 

H n (f ): High-frequencies in window n 

C n (J ): Wavelet coefficients (window n) 
25 & n (J ): Modified wavelet coefficients (window n) 

w=2 nh . Window size 

c = 2* k>0: Calculation rate 

/= 0, 1, 2 nr. Iteration (search) level 

a Buffer size (in cells) 
30 £ Allowed buffer utilization, 0<£< 1 

A(w+nc-"\y Right border of window n 

A(nc): Left border of window n 

[0065] The original traffic measurement curve of a window is split into a low-frequency component L and a high- 
35 frequency component H according to 



40 



A(s) =L n (s) 
nczs<w + nc 



01) 



where 



45 



max 
nc <; s < w + nc 



w * nc - 1 
V i ~ s 



(12) 



SO 



55 



is the break criterion between L and H. That is, to separate the high-frequency component Hfrom the low-frequency 
component L, the method searches for the maximum Hthat fulfills equation (12). For window n at iteration level /, the 
left hand side of equation ( 1 2) is determined by executing the following search criterion on the vector of high-frequency 
components H rt y[/], /= 1,2,. ...2 m , where max_dev is the maximum deviation of noise from zero and is determined by 
procedure new_cumsum [H r ^(2 m )], illustrated by the following pseudo-code: 
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procedure new_cumsum [x(M)] 

(x is a vector with M elements and as the first element) 



Noise = 0 
max_dev=0 
for i = 1 to M 

if(Noise + x(/)<0) 
Noise = 0 

else { 
Noise = Noise + x(f) 
max_dev = max(max_dev, Noise) 

} 

endfor 

output max_dev 



In the above procedure, the accumulant is never set below zero because in reality the available capacity in the past 
cannot be accumulated for use in the future. This is a modification of the cumsum(x) function provided in Droz I and 
Droz II, which is defined as the cumulative sum of the elements of x. That is, the Ah element of cumsum(x) is 



where Xj is the yth element of x. cumsum(x) carries over negative values in its computation, while new_cumsum(x) does 

not. 

[0066] The method searches iteratively for the strongest noise that still fulfills equation (1 2) using wavelet transforms. 
First, the wavelet transform W[] is executed to obtain the wavelet coefficients up to m levels of decomposition for 
measurement window n: 



Next, the method begins an iterative process using the inverse wavelet transform W [] , with/= 0, 1,2,..., m denoting 
the iteration level: 




C.M 



= W[A(s)] 



(13) 



nc £ s < w + nc 




(14) 



nc s s < w + nc 



with 
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C -jW " [C„(i), otherwise K J 

5 

[0067] After each back-transformation, the maximum criterion in equation (12) is checked to find the most intensive 
but still acceptable noise. In each iteration one more scale of coefficients is counted to the high-frequency component, 
whereas the rest of the coefficients are set to zero. Thus, the method searches for the largest /that still fulfills the 
maximum criterion. The two extreme cases are £0 and ^m. For >0, all traffic is counted as low-frequency, whereas, 
10 for /=m, everything is counted as high-frequency. The estimate of the effective capacity C e is then the maximum of the 
low-frequency component during the window: 



max L m (s), (16) 

15 nc <s s<w + nc 



which can be used for control purposes. If the wavelet processor uses cell counts directly, the effective capacity (in 
cells/second) is given by the output of the processor (in cell counts) divided by the sampling interval At (in seconds). 
20 [0068] Figure 9 illustrates a flowchart of a method consistent with the present invention for calculating the effective 
capacity using wavelet transforms. The method uses the following parameters: 

coeffs: vector of wavelet coefficients 

highj: vector of high frequencies 
25 lowjt. vector of low frequencies 

X: vector of traffic measurements 

h: tow-pass wavelet coefficients (for signal decomposition) 

g high-pass wavelet coefficients (for signal decomposition) 

scale: current level of wavelet decomposition 
30 nr_of_scales: number of levels of wavelet decomposition 

rh: low-pass wavelet filter coefficients (for signal reconstruction) 

rg: high-pass wavelet filter coefficients (for signal reconstruction) 

maxjdev: maximum deviation of noise from zero 

reset: set coarse and detail coefficients of level sca/e+1 and lower all to zero 
35 found; TRUE if maximum high-frequency component satisfying equation (12) is found, FALSE otherwise 

[0069] After initializing foundXo FALSE (step 200), scale to 0 (step 202), and high_f_old to 0 (step 204), the wavelet 
transformation is executed to obtain the vector of wavelet coefficients coeffs. The inputs to the wavelet transformation 
are the traffic measurements (X), the wavelet low- and high-pass filter coefficients (h and g), and the number of levels 

40 of decomposition (nr_oLscales). 

[0070] The signal/noise separation is done in the loop (steps 208-224). In step 208, if equation (12) is not satisfied 
(founcfcrrFALSE) and the level of decomposition is less than the total levels of decomposition, flow proceeds to step 
210, which increases the level by one. Flow proceeds to step 212, which sets new_coeffs to the coefficients obtained 
by the wavelet transform, and then to step 214, which resets the lower level coefficients to zero. In step 216, the noise 

45 is obtained by an inverse wavelet transform using the modified coefficients. Thus, in each iteration one more level 
becomes part of the vector of high frequencies and all the lower levels are set to zero. Flow proceeds to step 218, in 
which a cumulative sum function is applied to the noise, yielding a sequence of values, the maximum of which is the 
maximum deviation from zero. This value represents the maximum filling of the buffer during the observed interval. In 
step 220, the method determines whether the maximum deviation from zero has surpassed the product of the buffer 

so utilization and buffer size, i.e., %B. If max__devha$ surpassed found is set to TRUE (step 224), flow returns to step 
208, and then to steps 226-228 to calculate the effective capacity. If max_d0v^has not surpassed %B, high_Lofd'\s set 
to highj and flow returns to step 208. In step 208, if there are more levels of decomposition, flow proceeds to steps 
210-220 to continue searching for the highest noise meeting the break criterion. The break criterion is such that the 
maximum filling of the buffer is below the buffer size multiplied by the buffer utilization factor. If, on the other hand, the 

55 inverse wavelet transform has included every level of wavelet decomposition in its calculation, flow proceeds to steps 
226-228 to determine the effective capacity. 

[0071] In step 226, the low-frequency vector is computed as the difference between the vector of traffic measurements 
and the old high-frequency vector. For the case in which step 224 was reached because the maximum deviation had 
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surpassed £8, the old high-frequency vector is the last one such that Z>B was not surpassed. For the case in which 
step 224 was never reached because all levels of decomposition were analyzed without surpassing £8, the old high- 
frequency vector is set to the current high-frequency vector in step 222. In step 228, the effective capacity c is computed 
as the maximum value in the low-frequency vector. 

[0072] That is, the method obtains the filtered (low-frequency) component of the original signal by subtracting the 
noise (high-frequency) component from the traffic measurements. The effective capacity for the next period is then 
taken as the maximum of the filtered signal during the previously measured interval. 

[0073] The following pseudo-code represents the computation of the effective capacity C e illustrated by the flowchart 
of Figure 9. The procedure calls the signal decomposition and partial signal reconstruction procedures described earlier 

procedure EFFECTIVE_CAP [X(n), M f H, G, H t G, I buffer _size, 
buffer ^utilization] 

1. found = FALSE 
scale = 0 

7= number of scales 

Input signal vector, X(n), n=l,2, M, M - 2 m , Js m> m is any positive 

integer 

2. [S* (n). W (n)J=\X J] = procedure SIGNAL JDECOMP [X(n)> H t G, 
M, J] 

3. while (-found && scale <J){ 

scale -scale + 1 

S* (n) = procedure PARTI AL_RECONS [W (n),j=l,2, scale, 

H t G, scale) 

maxjiev = procedure new_cumsum[S*°(/7)] 
if (max_dev > buffer_size*buffer_utilization) { 

found = TRUE 

scale = scale - I 

} 
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4. 



} endwhile 

EfFective_capacityi C, = 0 
if (scale =0) { 



for n = 1 to M 



C e = msK(C e X(n)) 



endfor(n) 



} else 



{ 



\n) = procedure PARTIAL_RECONS [W (n),j=l,2, scale, 



A A 



H,G, scale] 



for n = 1 to M 



C e = max (C 0 X(n) - ST° (n)) 
endfor(«) 

} 

5. Convert C e to Mbps. If X(n) is given in cell counts, then 

C e in Mbps = (C e in cell counts* 53 *8)/sampling interval At in seconds) 



[0074] Figure 10 illustrates an example implementation of the signal/noise separation for the calculation of the ef- 
fective capacity detailed in Figure 9 using a 3-level wavelet decomposition. Steps indicated in Figure 10 correspond 
to those in Figure 9, and not all steps are shown in Figure 10 for the sake of clarity. First, a wavelet transform is 
performed on the input vector X (step 206), yielding the coarse wavelet coefficient c3 and the detail coefficients d1 , 
d2, and d3. Next, scale is set to 1 (step 210), and the reset function sets all coefficients below level 1 (i.e., all except 
d1) to zero (step 214). An inverse wavelet transform is performed to yield a first vector of high frequencies, hf1 (step 
216). If the maximum deviation of noise from zero is greater than the product of the buffer utilization factor and the 
buffer size, the effective capacity is the maximum value of the input vector X (step 228). The high-frequency vector is 
not subtracted because the search criterion was surpassed by even the first level of decomposition. If the maximum 
deviation of noise from zero is not greater than the product of the buffer utilization factor and the buffer size, processing 
continues at the beginning of the loop. Scale is increased to 2 (step 210), and the reset function sets all coefficients 
below level 2 to zero (step 214). Again, an inverse wavelet transform is performed to yield a second vector of high 
frequencies, hf2 (step 216). If the maximum deviation of noise from zero is greater than the product of the buffer 
utilization factor and the buffer size, the effective capacity is the maximum value of the input vector X minus the first 
high-frequency vector, hf1 (step 228). If the maximum deviation of noise from zero is not greater than the product of 
the buffer utilization factor and the buffer size, processing continues at the beginning of the loop. Scale is increased 
to 3 (step 210), and the reset function sets all coefficients below level 3 (only the coarse coefficient) to zero (step 214). 
Again, an inverse wavelet transform is performed to yield a third vector of high frequencies, hf3 (step 216). If the 
maximum deviation of noise from zero is greater than the product of the buffer utilization factor and the buffer size, the 
effective capacity is the maximum value of the input vector X minus the second high-frequency vector, hf2 (step 228). 
If the maximum deviation of noise from zero is not greater than the product of the buffer utilization factor and the buffer 
size, the effective capacity is the maximum value of the input vector Xminus the third high-frequency vector, hf3 (step 
228). 

[0075] An advantage in using the wavelet filtering technique described is the ability to include system parameters 
such as buffer utilization targets in the filtering process, which is not possible with the moving average or exponentially 
weighted moving average techniques. One guideline for selecting %B is so that the target buffer utilization level does 
not exceed a maximum permissible delay. That is, ^B^d,^ where c/ max is a maximum permissible delay. For example, 
£Bcan be chosen such that £fl<3ms. 

[0076] With different wavelet bases, it is possible to influence the speed of the noise detection and separation from 
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the signal. Droz I suggests that the above algorithm can be improved by performing the wavelet transformation analysis 
incrementally in the loop. Pipelined implementation of the transformation is also possible, as well as parallel imple- 
mentations. The most efficient way, however, is to interpret the coefficients directly on the different scales. Such an 
interpretation is strongly influenced by the chosen wavelet basis. 

5 

Sampling of Cell Arrivals 

[0077] Referring again to Figure 8, bandwidth measurement module 1 54 includes sampler 160 for generating periodic 
measurements of the incoming CBR/VBR traffic. These measurements consist of cell counts divided by Af, where At 
10 is the sampling interval. The sampling frequency depends on the link speed and the buffer size. Sampler 160 feeds 
its output into wavelet transform -based processor 164. In periodic intervals, processor 164 analyzes the samples and 
calculates the effective capacity, as detailed above. A sampling method consistent with the present invention is based 
on one given in Droz I. 

[0078] The highest sampling frequency is given by the maximum link speed (or output of the switch); on a 155Mbps 
is link, a cell can arrive every 2.735 microseconds at full load. Sampling at maximum speed can yield only two different 
measurement values: either a cell has been transmitted in the slot or it has not. Thus, when sampling at maximum 
speed it is not possible to observe any mean value, only noise. To perform a signal/noise separation where the noise 
is absorbed in the buffer, the sampling frequency must be high enough to capture a potential buffer filling between two 
sampling points. Therefore, the maximum number of cells that arrive in the time interval between the two measurement 
20 points must be below the buffer capacity. On the other hand, it is desirable to obtain more detailed information about 
the cell arrival process. Therefore, the sampling rate should be a fraction of the buffer size to capture a potential buffer 
filling between two sampling points. 

[0079] The sampling interval At must be chosen sufficiently large to obtain an accurate rate measurement and allow 
implementation; too large an interval, however, will lead to sluggish response. The measured rate is a multiple of the 

25 rate resolution 1/Ar, the interval At must be selected so that measurement of a solitary cell will not result in the rate 
being large compared to the link capacity. As a rough guideline, Af should be sufficiently large to produce a rate res- 
olution of 0.5% of the target rate, pjC, for instance. Thus, At should be chosen to be approximately ViO.OOSpjC). 
[0080] For a 155Mbps link and a 1000-^ell buffer, a sample once about every millisecond is a good value; it equals 
a maximum of 365 cells ( . l$5Mfrre*10' — _), which is about 1/3 of a frequently used buffer size in the range of 1 000 

30 cells. Higher sampling frequencies are^polssiDle but they increase the number of samples in the window. 

[0081] As illustrated in Figure 11, the effective capacity is continuously calculated using wavelet transforms applied 
on partially overlapping windows. The overlap is needed to capture trends across the borders of conservative windows, 
and the amount of overlap can depend on the extent of adaptability expected in the measuring process. Of course, 
the specific settings that are appropriate in any deployment environment depend on the characteristics of that envi- 

35 ronment {i.e., the traffic characteristics and link bandwidth), on how much ABR traffic a network operator wishes to 
admit on a link, and on the level of risk the network operator is willing to take that the service requirements will occa- 
sionally be violated. 

Buffer Manager 

40 

[0082] Referring again to Figure 8, bandwidth measurement module 1 54 includes buffer manager 1 62 for monitoring 
the queue in buffer 166 and signaling wavelet transform processor 164 accordingly, which responds to these signals 
by adjusting the traffic filtering process. For example, if the queue length in buffer 166 exceeds a threshold or if cell 
loss occurs, buffer manager 1 62 reduces the buffer utilization factor, and passes the new factor to wavelet transform 

45 processor 1 64. This makes flow control schemes consistent with the present invention more conservative in the sense 
that less filtering of high-frequency components is done. Typically, the current buffer utilization factor might be multiplied 
by a factor of 0.8. After a certain period of time has passed without cells being lost, buffer manager 162 increases the 
buffer utilization factor and passes this value to wavelet processor 164, causing more filtering of high-frequency com- 
ponents. A typical multiplication factor is 1 .1 , which increases the link utilization by 10%. In general, the reduction factor 

50 should be more drastic than the increase factor to avoid strong oscillations between cell loss and no-loss periods. 
[0083] In high-speed networks such as ATM networks, a cell loss rate of 10 -8 or smaller is frequently requested. 
Therefore, before buffer utilization can be increased, at least 10 8 cells must have been transmitted without losses. A 
higher value of approximately 10 10 cells can be used to decrease the chances of reverting back to a loss period. Thus, 
on a 155Mbps link under full load, a period of 7.5 hours must pass before buffer utilization can be increased. For this 

55 reason it is preferable to engineer and fix the utilization on links with speeds below 2 Gbps. Only on high-speed links 
should buffer manager 162 signal wavelet transform processor 1 64 to adjust its filtering based directly on the cell loss 
rate because only on a high-speed link can this parameter be measured with good confidence in a reasonable time. 
On a 2.4Gbps link at full load, for example, 10 10 cells are transmitted in a period less than 30 minutes, so the cell loss 
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rate can be measured fairly accurately. The higher the link speed, the faster the cell loss rate can be measured accu- 
rately. 

Bandwidth Measurement Event Diagrams 

5 

[0084] Figures 1 2A-B illustrate event diagrams for the wavelet-based traffic filtering performed by processor 1 64 in 
response to the traffic sampled by sampler 160. Af is the sampling interval and Ac represents the computation time for 
obtaining the effective capacity In Figure 12A, Ar is greater than or equal to Ac. A first window begins at nc, and a 
second window begins at (n+1)c. The minimum possible difference between the start of the first and second window 
10 is Ar because each window must start at the beginning of a sampling interval, and the effective capacity must be 
obtained for the first window before the end of the second window. On the other hand, in Figure 1 2B, At is less than 
Ac. Here, the second window cannot begin Af after the first window because the effective capacity for the first window 
has not yet been obtained. Rather, the minimum shift between the first and second windows is 9Af, where 9 = r Ac/ 
AD, i.e., the next integer greater than or equal to Ac/Af. 

15 

Flow Control 

[0085] Figure 1 3 illustrates an ABR flow control system consistent with the present invention and shows the system 
illustrated in Figures 5 and 8 in more detail. Cell arrivals are divided into CBRA/BR traffic and ABR traffic, each of which 

20 enters a queue in its own set of buffers, 1 66 and 1 70, respectively. The portions of the bandwidth measurement module 
pertaining to CBR/VBR traffic are sampler 160, buffer manager 162, and wavelet transform processor 164, as were 
also shown in Figure 8. The output of wavelet transform processor 164 is the effective capacity, C 0 , of CBRA/BR cell 
arrivals, i.e., the low-pass filtered bandwidth obtained as described above. ABR arrival rates are also measured by 
short-term rate measurement module 172, whose output is the ABR rate, C a . In a system consistent with the present 

25 invention, measurement module 172 can measure ABR arrivals using simple short-term instantaneous rate measure- 
ments. Alternatively, measurement module 172 may implement an effective capacity estimator similar to the one used 
to measure CBRA/BR arrivals. 

[0086] Flow control module 1 56 uses the CBR/VBR effective capacity, the ABR capacity, C a and the ABR queue 
length to detect early congestion and outputs a decision related to flow control, e.g., the state of congestion of the 
30 switch. In a method consistent with the present invention, flow control module 156 detects congestion when the total 
traffic rate C 0 + C a exceeds a threshold, e.g., a percentage of the available capacity on the outgoing link or virtual path, 
or when a queue threshold is exceeded. For example, congestion is detected when 



35 C e +C g >r\Corq>q t (17) 

where t) is a congestion detection thresholding factor (e.g., tj = 0.9) and C is the link or virtual path capacity. Upon 
detecting congestion, flow control module 1 56 indicates congestion by, e.g., setting the EFCI bit to 1 in headers of data 
cells transmitted downstream. 

40 [0087] The following pseudo-code illustrates the operation of an EFCI flow control mechanism consistent with the 
present invention for a single FIFO ABR queue: 



45 



SO 



55 



18 



EP0 999 718 A1 



procedure EFCI_CONTROL (EFCI Scheme with Rate Threshold Congestion 
Detection) 

1 . {Initialization} 
Initialize rate threshold factor (e.g., T] = 0.9); 
Initialize ABR traffic queue threshold, queue ^THRESHOLD; 

2. {At every cycle (window size) = w)} 

Update rate threshold factor t] and queue THRESHOLD, if necessary; 
is 3 . Receive a data cell from upstream node of virtual channel; 

if (C e + C a 2 rfQ or (queue ^LENGTH * queue ^THRESHOLD) 



10 



20 



25 



40 



KI ; CI : I ; 

Send data cell to the downstream node. 



[0088] It will be appreciated by those skilled in this art that various modifications and variations can be made to the 
flow control mechanism consistent with the present invention described herein without departing from the spirit and 
scope of the invention. Other embodiments of the invention will be apparent to those skilled in this art from consideration 
of the specification and practice of the invention disclosed herein. 

30 [0089] A flow control system and methodology controls and avoids congestion in an Asynchronous Transfer Mode 
(ATM) network. The congestion state of a network switch is determined based on incoming Available Bit Rate (ABR) 
traffic, Constant Bit Rate (CBR) traffic, and Variable Bit Rate (VBR) traffic. High-frequency fluctuations due to VBR 
traffic are filtered out of the CBR/VBR traffic to determine the underlying trend of network traffic. A filtering mechanism 
is based on wavelet transforms. The switch congestion state is determined based on the current ABR traffic, the filtered 

35 CBR/VBR traffic, and the current status of buffers in the switch. The congestion state can then be used to adjust the 
ABR rate according to a feedback scheme such as Explicit Forward Congestion Indication (EFCI). 



Claims 



1 . A system for use in controlling congestion in a network, the network including a network switch having a first buffer 
for receiving packets of a first type into a first queue and a second buffer for receiving packets of a second type 
into a second queue having a queue length, the system comprising: 

45 a first measurement module coupled to the first buffer for measuring an arrival rate of the packets of the first 

type, including 

a sampler for sampling the arrivals of the first type packets; and 

a wavelet transform processor coupled to the sampler for determining an effective arrival rate of the first 
so type packets in response to the sampled arrivals; 

a second measurement module coupled to the second buffer for measuring an arrival rate of the packets of 
the second type; and 

a flow control module, coupled to the first and second measurement modules and the second buffer, for de- 
55 termining a congestion state of the switch based on the arrival rates of the first and second type packets and 

the queue length of the second queue. 

2. The system of claim 1 wherein the first measurement module further comprises a buffer manager coupled to the 
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first buffer and the wavelet transform processor, the buffer manager including 
means for monitoring the length of the first queue; and 

means for outputting a signal if the queue length exceeds a predetermined threshold; 

5 

and wherein the wavelet transform processor determines an effective arrival rate further in response to the signal. 

3. The system of claim 1 wherein the wavelet transform processor includes means for separating the sampled arrival 
rate into a high-frequency component and a low-frequency component. 

10 

4. The system of claim 3 wherein the separating means includes means for implementing a discrete wavelet trans- 
form. 

5. The system of claim 3 wherein the separating means includes means for isolating the high-frequency component 
15 based on a second predetermined threshold. 

6. The system of claim 5 wherein the second predetermined threshold is based on a measure of the buffer size. 

7. The system of claim 5 wherein the isolating means includes means for implementing an inverse discrete wavelet 
20 transform. 

8. The system of claim 3 wherein the wavelet transform processor further includes means for filtering the high-fre- 
quency component from the sampled arrival rate. 

25 9. The system of claim 1 wherein the flow control module modifies a second type packet based on the switch con- 
gestion state. 

10. The system of claim 1 wherein the network is an ATM network, the packets of a first type comprise CBR and VBR 
packets, and the packets of a second type comprise ABR packets. 

30 

11. A method for controlling congestion in a network, the network including a network switch having a first buffer for 
receiving packets of a first type into a first queue and a second buffer for receiving packets of a second type into 
a second queue having a queue length, the method comprising the steps of: 

35 measuring an arrival rate of the packets of the first type, including the steps of 

sampling the arrivals of the first type packets; and 

determining an effective arrival rate of the first type packets in response to the sampled arrivals using 
wavelet transforms; 

40 

measuring an arrival rate of the packets of the second type; and 

determining a congestion state of the switch based on the arrival rates of the first and second type packets 
and the queue length of the second queue. 

45 12. The method of claim 11 wherein the step of measuring the arrival rate of first type packets further comprises the 
steps of 

monitoring the length of the first queue; and 

generating a signal if the queue length exceeds a predetermined threshold; 

so 

and wherein the step of determining the effective arrival rate of the first type packets is further in response to the 
signal. 

13. The method of claim 11 wherein the step of determining an effective arrival rate of the first type packets includes 
55 the step of separating the sampled arrival rate into a high-frequency component and a low-frequency component. 

1 4. The method of claim 1 3 wherein the separating step includes the step of implementing a discrete wavelet transform. 
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15. The system of claim 13 wherein the separating step includes the step of isolating the high-frequency component 
based on a second predetermined threshold. 

16. The system of claim 13 wherein the separating step includes the step of isolating the high-frequency component 
5 based on a measure of the buffer size. 

17. The system of claim 15 wherein the isolating step includes the step of implementing an inverse discrete wavelet 
transform. 

10 18. The system of claim 13 wherein the determining step further includes the step of filtering the high-frequency com- 
ponent from the sampled arrival rate. 

19. The method of claim 11 further comprising the step of modifying a second type packet based on the switch con- 
gestion state. 

is 

20. The method of claim 11 wherein the network is an ATM network, the packets of a first type comprise CBR and 
VBR packets, and the packets of a second type comprise ABR packets. 

20 
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